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1 Introduction 

The most widely known result of Chevalley-Warning type states that if one 
has a polynomial over a finite field of characteristic p, and the number of 
variables exceeds the degree, then the number of zeros is a multiple of p. In 
particular, if the polynomial is homogeneous, there is at least one non-trivial 
zero. 

There are a number of related results in the literature. Generally, let ¥ q 
be a finite field, and let f = (/i(x), . . . , / r (x)) be an r-tuple of polynomials 
fi(xx, ■ ■ ■ ,x n ) G Fjxi, . . . ,£„]. Let di be the total degree of and write 
d = di + . . . + d r . For any subset 5 C F" we put 



and 



z(f ; S) = z(S) :={xeS : f (x) = 0} 



jV(f; S)=M(S):=#z(f;S). 



Then the original 1936 result of Chevalley [2] stated that if n > d and z(W™) 
is non-empty, then it contains at least 2 points. Immediately afterwards it 
was shown by Warning [3] that if n > d then 

p I JVTO, (i) 

where p is the characteristic of ¥ q . In fact this followed from a trivial re- 
arrangement of Chevalley' s argument. In the same paper Warning proved 
that if n > d, and if Hi and H2 are parallel affine hyperplanes in F" then 

M{H X ) =N{H 2 ) (modp). (2) 

As a corollary he deduced that if z(¥ q ) is non-empty then 

A/TO > q n ~ d (n>d). (3) 
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Later, in 1964, Ax [TJ strengthened ([T]) by showing that indeed one has 

g|A/-(F") (n>d). (4) 

When n is large compared with d there are further improvements possible, 
as was shown by Katz [4] for example. These sharpenings of (J4J) lead to 
improvements of all the results we will establish. However we wish to focus 
here on the situation in which n is not much larger than d, and so will be 
content with using (jlj). 

Our first result shows that there is a strengthening of ([2]) corresponding to 
Ax's improvement of ([T]) . To stress that the affine linear spaces we encounter 
are not necessarily vector subspaces of F™ we will work over A n (¥ q ), but of 
course the sets z(A n (¥ q )) and z(F%) are the same. 

Theorem 1 With the notation above, we have 

N(Lx) = N(L 2 ) (mod q) (5) 
for any two parallel linear spaces Li,L 2 C A n (Fg) of dimension d or more. 

Indeed our argument shows how one may deduce (j^D from (J4J). 

We next look at improvements to (J3]). If K — ¥ q k has field basis ui, . . . ,u k 
over F = ¥ q we set 

N k (xt, ...,x k ) := N K/F (x 1 u 1 + . . . + x k u k ). (6) 

This produces a form of degree k in k variables, with the property that 
Nk(xi, . . . , Xk) = with variables Xi G F q only when the Xi are all zero. In 
particular, if f consists of a single polynomial f\ = Nd then z(f; A n (F g )) will 
be a linear space of dimension n — d, and we will have equality in ([3]). In 
contrast we have the following results. 

Theorem 2 Suppose that n > d and that z(A n (¥ q )) is non-empty, and is 
not a linear subspace of A n (¥ q ). Then 

(i) For any q we have J\f(A n (¥ q )) > q n ~ d j - 

(it) Ifq>Awe haveAf(A n (¥ q )) > 2q n ~ d ; and 

(Hi) For any q we have Af(A n (¥ q )) > q n+l ~ d /{n + 2 — d) providing that the 
polynomials f are homogeneous. 
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Our proof of this will use only combinatorial facts about A n (¥ q ), along with 
(jSJ) and its consequence (jlj). Thus the analogues of these results would be 
true if one replaced z(A n (F q )) by any set with the property ([5]) for every pair 
of parallel d- dimensional linear spaces Ai, A2 C A n (¥ q ). In particular we see 
that part (ii) is best possible in this sense, since (j^J) would hold if z(A n (¥ q )) 
were composed of two parallel linear spaces of dimension n — d. 

It may be instructive to examine two simple examples at this point. The 
first shows that one cannot hope to improve part (iii) of Theorem [2] to say 
that Af(A n (¥ q )) > q n + l ~ d _ Here we take f to consist of the single polynomial 
Q(xi, . . . , X4)A" n _ 4 (x5, . . . , x n ). Here A" n _ 4 is a norm form of degree n — 4, as 
in (jf)]), and 

Q(x l} X 2 , X 3 , 2 4 ) = X X X 2 + £3 + X3X4 + Cx\ 

is a quadratic form, with the coefficient c G F 9 chosen so that x\ + X3X4 + cx\ 
does not factor over W q . Then Q = has precisely q 3 — q 2 + q solutions 
(xi, . . . , 24) G F^, and since d = n — 2 in this case we have 

Af{f ; A n (F,)) = - g" 1 + g" 2 ) < g n+1 ~ d . 

Indeed this also shows that part (ii) of the theorem cannot be extended to 
3 = 2. 

Our second example shows that when f consists of a single polynomial 
/1, it is possible for z(A n (¥ q )) to be a linear space of dimension n — k even 
though /1 does not split into linear factors. For this we take n = 4 and choose 
a field basis l,a for ¥ q 2 over F 9 . Let and N' denote norm forms for F g 4 
over ¥ q 2 and ¥ q 2 over F 9 respectively, using the basis 1, a in the latter case. 
Let a be the nontrivial automorphism of ¥ q 2 over ¥ q . Then we may write 
AT(x) = Qi(x)+aiQ2(x) for certain quadratic forms Qi, Q2 over F g . It is clear 
that we may use a linear change of variables over ¥ q to write Q\ + aQ2 = Y1Y2 
and Qi + a a Q2 = Y3Y4 with independent variables Yi, . . . , Y4. It follows that 
if we take (3 G F 9 2 \ ¥ q different from a and a a then Qi + (3Q 2 = will have 
rank 4. Such a /3 certainly exists, except when q = 2. We now see that the 
polynomial /(x) = (Qi(x) + /3Q 2 (x))(Qi(x) + /3g 2 (x)) fJ is defined over F g 
and does not split into linear factors. None the less we have / = only for 
x = 0. 

Finally we will investigate the structure of z(i; A n (¥ q )) geometrically. 

Theorem 3 Given n and d there is a constant C(n,d) as follows. For any 
set of polynomials f whose degrees total d there is a corresponding set of ab- 
solutely irreducible varieties Vi, . . . , V m defined over¥ q such that z(f; A n (¥ q )) 
is precisely the set of ¥ q -points in V\ U . . . U V m . Moreover each variety Vi 
has degree at most C(n,d), and m < C(n,d). 
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The Lang- Weil bound [5] now gives us the following immediate corollary. 

Corollary Suppose that the largest dimension among the varieties Vi is D, 
and that this occurs k times. Then, enlarging C(n,d) if necessary, we have 

|jV(f; A n (F,)) — kq D \ < C(n, d)q D - 1 ' 2 . (7) 

Thus if q ^> n ,d 1 we have D > n — d, and indeed if z(f; A n (¥ q )) is not a linear 
space of dimension n — d then D > n + 1 — d. In the latter case we have 

Af(f; A n (F g )) > q n+1 - d - C(n, d)q n+1/2 - d 

for q > M 1. 

One would conjecture that D > n — d irrespective of the size of q, and it 
would be of interest to have a direct geometrical proof of this. Of course, 
the lower bounds on D follow on comparing (J2J) with ([3]) and part (iii) of 
Theorem [2j 

Acknowledgement This research was begun while the author was vis- 
iting the Institute for Advanced Study, in Princeton. The hospitality and 
financial support of the institute is gratefully acknowledged. 

We now present our proof of Theorem[TJ Given a polynomial f(x\, . . . , x n ) 
of total degree e there are two reasonable ways of associating a form to it. 
One may take f-(x\, . . . , x n ) to be the homogeneous part of degree e, or one 
may define 

f+(xo, . . . , X n ) = Xq/(Xi/ Xq, . . . , X n f Xo). 

For a system f we define f_ and f + by the above processes, using degree 
e = di for each polynomial /«. Clearly each zero of f produces exactly q — 1 
zeros of f + with x 7^ 0; and the zeros of f + with xq = correspond precisely 
to the zeros of f_. Thus 

A-(f + ;F^ +1 ) = ( g -l)Af(f;F-)+Af(f_;F£). 

In particular, if n > d then (j3J) yields q \ A/"(f+; F" +1 ) and hence 

Af(f;¥ n q )=Af(f^¥ n q ) (mod q). 

We therefore see that the value of ftf(f;W q ) modulo q only depends on the 
leading homogeneous parts of the polynomials f 1: . . . , f r . 

Now suppose we have parallel m-dimensional affine linear spaces Li and 
L2 in F™. We may represent the restriction of the forms f to L\ by a set of 

polynomials (g[ , . . . , g^) in m variables; and similarly for L2. Since L\ and 
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L 2 are parallel, the leading homogeneous parts of g± and will be the 
same. We are therefore led to Theorem [TJ 

Our proof of Theorem [2] is based on the following two lemmas. 

Lemma 1 Let L Q C A n (F g ) be a linear space. Choose a linear space L of 
maximal dimension, k say, such that L D Lq and M{L) = N(Lq). Suppose 
V D L is a linear space of dimension k + 1 such that M{L') is minimal. 
Then 

Af(A n (W q )) > Af(L) + q — (Af(L') - Af(L)). (8) 

Lemma 2 Let S C A*(F g ) be a set containing t+1 points in general position. 
Then 

(i) If q = 2, and if there is no 2-plane L C A*(F g ) meeting S in exactly 3 
points, then S = A t (¥ q ). 

(ii) If q > 3, and if £ C S for every line £ meeting S in at least two points, 
then S = A*(F 9 ). 

(in) If q > 4, and if #(S (1 £) > q — 1 /or eu ery line i meeting S in at least 
two points, then A*(F g ) \ S is contained in a hyperplane. 

(iv) If in > 2 is an integer, and if #(SC\£) > m + 1 for every line £ meeting 
S in at least two points, then 

m — 1 

To deduce Theorem [2] we assume that z(A n (¥ q )) contains a maximal set 
oft>n + l — d points in general position. We take A*(F 9 ) to be the space 
spanned by these points and apply Lemma [2] to the set S consisting of zeros 
of f lying in A*(F g ). We will give the details required for the different parts 
of Theorem [2] in due course, but the general strategy is as follows. Lemma H] 
will either provide a suitable lower bound for A/"(A n (F 9 )), or produce a linear 
space Lq to which we will apply Lemma [TJ This linear space will have di- 
mension two in the case of part (i) of Lemma El or dimension one otherwise. 
We will have arranged that Af(L ) < q — 1, and since N{L) = Af(L ) in 
Lemma [TJ the dimension k of L can be at most d, for otherwise (jSJ) will give 
us a contradiction. If k < d — 2 then flBJ) implies that 

ffi— k i .,n+2—d i 

A/YA n (F„)) > ^ > ^ >n+l-i 

q — 1 q — 1 



5 



which is a satisfactory lower bound. Thus we may assume that either L or 
V has dimension d. Let L\ = L or L' as appropriate. If H{L\) takes a 
value v < q — 1 then we may deduce from (j^J) that N{L*) > v for every affine 
(i-plane L* parallel to L±. Covering A n (F q ) with such linear spaces we deduce 
that J\f(A n (¥ q )) > vq n ~ d . Such bounds as these will suffice in all cases for 
the theorem. 

We begin by establishing part (i) of Theorem [2J From parts (i) and (ii) 
of Lemma|2]we may obtain a satisfactory bound N(A n (¥ q )) > q n+l ~ d unless 
either q = 2 and there is a 2-plane Lq C A n (F g ) with N(Lq) = 3, or q > 3 
and there is a line L C A n (F g ) with 2 < Af(L ) < q — 1. We now apply 
Lemma HJ As above we must have k < d. Moreover if k < d — 1 then (jSJ) 
yields 

jV r (A n (F,)) > ^ — - > q n ~\ 
which is satisfactory. If k = d and q = 2 then (jHJ) produces 

A/"(A n (Fq)) > 3 + 2 " 2 ^ ~ 1 > 2 n ~ d . 

Finally, if fc = d and g > 3 then 2 < A/"(L ) = J\f{L) < q - 1, and the 
argument above, with 2 < u < g — 1, shows that A/"(A ra (F g )) > 2q n ~ d . Thus 
A/"(A n (F 9 )) > q n ~ d in all cases. 

We turn now to part (ii) of Theorem |2j By part (iii) of Lemma [2] we 
have W(A n (Fq)) > g n+1 - d - g n ~ d > 2g n " d unless there is a line L C A n (F g ) 
with 2 < Af(L ) < q — 2. As above, when we apply Lemma [T] we may 
assume that k = d — 1 or d. If k = d then 2 < M{L) < q — 2 and we 
may apply (|5]) as before to conclude that A/"(A ra (F g )) > 2q n ~ d . Similarly if 
k — d — 1 and A/"(L') - JV(L) = 1 we have 3 < Af(L') < q - 1 and therefore 
A/"(A n (F 9 )) > 3q n ~ d . Finally, if fc = d - 1 and Af(L') —Af(L) > 2 we deduce 
from © that 

„n+l— d 1 

A/"(A n (Fq)) > 2^— — - — > 2g"" d . 

This establishes the required bound in all cases. 

To prove the third claim of Theorem [2] we apply Lemma [2] part (iv) with 
an integer m < q — 1 to be chosen in due course. Thus 

jy.n+2-d _ i 

AW(F„)) > > m n+1 - d 

m — 1 

unless there is a line with 2 < Af(L ) < m. In the latter case we deduce from 
Lemma [1] that 

Af(A n (¥ q ))> q -— r {M{L')-m), 
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and by the previous argument it suffices to consider the values k — d — 1 and 
k — d. If k — d — 1 we deduce that J\f(A n (¥ q )) > q n ~ d (q - m), unless 

m + 1 < N{Lf) <q-l. 

However, in the latter case we conclude by our standard argument using §5§ 
that J\f(A n (¥ q )) > (m + l)q n ~ d . In the homo geneous case the value k = d 
cannot occur. To show this we consider two cases. If k = d and 6 I then 
q — 1 | M{L) — 1, since if x e z(L) then every scalar multiple of x is also 
in z(L). This however is impossible since 2 < Af(L Q ) = < m. On 

the other hand, if (jL L we consider the (d + l)-dimensional linear space 
V := (L, 0). Here we find that Af(L') = 1 + (q - l)Af(L). According to ® 
we will have q \ Af(L'), whence N{L) = 1 (mod q). This again is impossible, 
since 2 < N(L) < m. It follows that one of the inequalities 



N(A n (W g )) > m 



n+l-d 



or 

N(A n (F q )) > q n - d (m + l), 

or 

A/"(A n (F 9 )) > (m + 
must hold. The required estimate now follows on choosing 



m 



q 



and noting that 



for any positive integer v. 



n + 2 - d 



l V l 



1 > 



v + 1 J ~ v + 1 



We now prove Lemmas [T] and [2j For the first of these it is enough to 
observe that A n (¥ q ) is the disjoint union of L together with the sets L* \ L, 
with L* running over all (fc+l)-dimensional linear spaces containing L. There 
are (q n ~ k — l)/(q — 1) such spaces L*. Thus the lemma follows from the fact 
that Af(L* \L)> Af(L') - N(L). 

Lemma [2] will require distinctly more work. We prove the four claims 
separately, using induction on t in each case. The various statements are all 
trivially true for t — and t — 1. To handle the induction step for case (i) 
we may assume that S contains a linear space L say, of dimension t — 1, 
together with a point P say, not on L . Now choose any point P ^ L with 
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P 7^ Pq, and aim to show that P G S. We will then be able to conclude 
that S = A*(F g ) as required. Let Pi be any point in Lq and consider the 
2-plane generated by P, Pq and P\. Since q — 2, this plane consists of the 
three generators together with a fourth point P2 say, which must belong to 
Lq. Our construction has ensured that P, Pi and P2 are all in S, and so by 
our hypothesis we also have Pi G S, as required. 

For part (ii) we use the same initial setup. We assume that S contains 
a linear space Lq of dimension t — 1, along with a point Pq not in Lq, and 
we choose a point P G" different from Pq. Suppose firstly that the line £ 
generated by Pq and P meets Lq, at a point Pi say. Then £ meets S in at 
least 2 points, namely Pq and P\. Then, by our hypothesis, £ is contained in 
S, whence P in particular belongs to S. This deals with all points P except 
those which lie on the hyperplane, Li say, which is parallel to Lq and which 
passes through Pq. To handle such points P we begin by fixing any point Pi 
on Lq. We then consider the line £ generated by P and P\. Since q > 3 this 
line contains at least one point additional point P2 say, which cannot lie in 
L\. Thus Pi G S by what has already been proved. Hence £ meets S in at 
least two points, namely Pi and P2, and our hypothesis implies that every 
point of £ belongs to S. In particular P G S, as required. 

The proof of part (iii) of Lemma [2] is the most involved. We will write 
S c for the complement A*(F ? ) \ S of S. Our strategy will be to show that 
if S° also contains t + 1 points in general position, then both #S > \q l 
and j^S c > \q l , which will provide a contradiction. We observe that the 
hypothesis of part (iii) is symmetric between S and S c , since S meets £ in at 
least two points if and only if #(S c n£) < q — 1. Now let R be either S or S c , 
and assume that R contains t + 1 points P , . . . , P t in general position. For 
our inductive assumption we suppose that, for any linear space L C A*(F ? ) 
of dimension t — 1, either R (1 L fails to contain t points in general position, 
or L \ R is contained in a proper linear subspace of L. Thus either L fl R 
or L n P c lies in a proper linear subspace of L. When L = Lq is generated 
by Pi, . . . , Pt we must be in the second case. For every P G Lq, the line £ 
generated by P and Pq meets R in at least two points (namely P and P ) 
and hence contains at least g — 1 points of R. For distinct choices of P the 
sets £\ {Pq} are disjoint, whence 

#P>i + (g-2)#(L nP). (9) 

Now suppose that every linear space L of dimension t — 1, parallel but not 
equal to Lq, has the property that L fl R lies in a proper linear subspace of 
L. Then since A*(F ? ) is a disjoint union of Lq with the various spaces L, we 
see that 

#P<#(£ nP) + (g-iy- 2 . 
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Comparing this with (J2J) yields (q — 3)#(L D R) < (q — l)q^ 2 , and since 
#(L PI R) > q l ~ l — q l ~ 2 we find that q < 4. Thus, under the assumption 
that g > 4, there must be at least one space L±, parallel but not equal to L , 
for which it is L\ fl R c which is contained in a proper linear subspace of L\. 
If we pick any point Q G L\ and count points of R on lines from Q to Lq 
we will obtain at least (q — 2)#(L fl R) points of R not lying on L x , by the 
argument that established (jH])- Allowing for points of L x fl R we find that 

#R>(q~ 2)#(L n i2) + n it"). 

However we have arranged that L fl i? c and fl i? c are both contained in 
proper linear subspaces, so that #(L PI R) > — q ,< ^ 2 , and similarly for 
n i?). It then follows that #R > (q - ifq^ 2 > \q\ since q > 4. As 
explained above, this inequality leads to the claim made in part (iii) of the 
lemma. 

Finally we turn to part (iv) of Lemma [2J For the induction step we 
assume that we have a linear space L of dimension t — 1, along with a point 
Pq not in L , such that 

frit — i 

#(SnL )> 

m—1 

and Pq G S. If P is any point in S fl L the line £ generated by P and P 
contains at least 2 points of S, and hence by our hypothesis contains at least 
m+1 such points. For different points P the sets £\{Pq} are disjoint, whence 

777^ — 1 ^oo^+l — 1 

#S > 1 + m#(S n Lq) > 1 + m = — . 

m—1 m — 1 

This completes the induction. 

We now establish Theorem [3j If any of the polynomials f vanishes identi- 
cally we may clearly remove it from the collection, and if any of the polyno- 
mials is a non-zero constant then z(i; A n (¥ q )) is empty, so that the result is 
trivial. It follows that we may assume that each of the fi has positive degree, 
whence r, which is the number of polynomials fi, is at most d. Thus the 
zero-set of the polynomials f , over ¥ q is an algebraic variety, V say, of degree 
at most d d and dimension at most n. This variety is defined over ¥ q , and 
is not necessarily absolutely irreducible. Indeed V may contain irreducible 
components which are not defined over ¥ q . We call such components "bad", 
and our goal is to replace V by a variety V* with no bad components, and 
such that V(¥ q ) = V*(¥ q ). We proceed to describe a general "reduction 
process" for our varieties. Suppose that V has at least one bad component. 
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Choose such a component, W say, of maximal dimension D say. Let the 
Galois conjugates of W be W = W±, . . . , Wk- Since each of these is an ir- 
reducible component of V we see that k < deg(V). Any F^-point of W lies 
on W := W\ fl . . . H Wk- Thus if we replace the components W\, . . . , Wk 
of V by W, and call the resulting variety V, we see that V(F q ) = V'(¥ q ). 
Moreover V contains no bad components of dimension greater than D, and 
since dim(W') < dim(VT) there is one fewer bad component of dimension D. 
We also note that 

deg(W') < deg(W^) fc < deg(V) dcs{v) 

whence 

deg(y') < deg(V) + deg(\/) dcg(v ' ) . 

We now repeat the reduction process, passing from V to V" and so on. Since 
the number of bad components of maximal dimension is reduced at each step, 
while the degree remains under control, the process will eventually terminate, 
and will produce a variety V* whose degree is bounded in terms of deg(V) 
and n. The theorem then follows. 
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